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Abstract 

Consider a normally distributed response variable, related to an explanatory variable through 
the simple linear regression model. Data obtained on the response variable, corresponding to 
known values of the explanatory variable (i.e., calibration data), are to be used for testing hy- 
potheses concerning unknown values of the explanatory variable. We consider the problem 
of testing an unlimited sequence of one sided hypotheses concerning the explanatory variable, 
using the corresponding sequence of values of the response variable and the same set of cali- 
bration data. This is the situation of multiple use of the calibration data. The tests derived 
in this context are characterized by two types of uncertainties: one uncertainty associated with 
the sequence of values of the response variable, and a second uncertainty associated with the 
calibration data. We derive tests based on a condition that incorporates both of these uncer- 
tainties. The solution has practical applications in the decision limit problem. We illustrate 
our results using an example dealing with the estimation of blood alcohol concentration based 
on breath estimates of the alcohol concentration. In the example, the problem is to test if the 
unknown blood alcohol concentration of an individual exceeds a threshold that is safe for driving. 
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1. Introduction 

The problems addressed in this article are in the set up of a linear regression model where 
normality is assumed for the dependent variable. Thus let y x , y 2 , •••, Vn be N independent 

observations on a response variable corresponding to the values x x , x 2 , ■■■, respectively, of 

an explanatory variable, where, 

Vi ~ N(a + 0x jt a 2 ), l 11 ) 

j — 2, ..., N. The z/s are assumed to be non-random quantities. Now let yo be another 

observation corresponding to an unknown value, say 9, of the explanatory variable, where y 0 is 
independent of the y/s in (1.1). Thus we have 

y 0 ~ N(a + (39,o 2 ). (1-2) 

The problem of calibration, or inverse regression, deals with statistical inference concerning 6. 
The set of y/s in (1.1), corresponding to the known values {j = 1, 2, ..., N), is referred to 
as the calibration data. The relationship between the y/s and the corresponding Xj 's is known 
as the calibration curve. We are thus dealing with the situation where the calibration curve is 

a straight line. . 

Most of the available literature on the calibration problem deals with point estimation of 9, 
or the construction of confidence regions for 9. The purpose of this article is to address some 
hypothesis testing problems concerning 9. More specifically, we shall consider the following 
testing problems: 

(i) H 0 : 0 > c vs Hi : 9 < c, (ii) H 0 : 9 < c vs Hi : 9 > c, (1.3) 

where c is a known number, specified in advance. Our investigation of this problem was moti- 
vated by the following application. 

Estimation of blood alcohol concentration - an example 

This example deals with the calibration of breath estimates of blood alcohol concentration, 
based on the results of a laboratory test. Here, the y/s are breath estimates of blood alcohol 
concentrations and the xfs are the actual concentration of alcohol in the blood, obtained by 
a laboratory test. The relevant data, given in Section 3, are based on a study conducted at 
Acadiana Criminalstics Laboratory, New Iberia, Louisiana. It turns out that the model (1.1) 
provides an adequate fit for the data. We can use the models (1.1) and (1.2) in order to estimate 
an unknown blood alcohol concentration 9 , after obtaining the corresponding breath estimate 
y 0 . Such an estimation is preferable to the actual laboratory determination of the blood alcohol 
concentration, since it is much easier and faster to obtain the breath estimates. In this context, 
we would also like to test whether an unknown blood alcohol concentration 9 is less than or 
equal to 0.10 or if it exceeds 0.10, since the legal maximum limit of blood alcohol concentration 
while driving is 0.10% in many states in the USA (in some states it is 0.08%). This is precisely 
the testing problem (ii) in (1.3). 

The hypothesis testing problems in (1.3) will also arise in the context of decision and de- 
tection limit problems in bioassays and chemical assays. Suppose we want to determine the 
concentration of a chemical in a sample where a direct determination of the concentration may 
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be difficult or expensive. On the other hand, it may be easier to obtain an indirect measurement, 
for example, spectral measurements. First, several samples having known concentrations (the 
x-'s) are prepared, and the corresponding indirect measurements (the %’s) are obtained. As- 
suming that the relationship (1.1) holds, we can use this data for statistical inference concerning 
an unknown concentration, say 9, in a sample, after obtaining the corresponding indirect mea- 
surement j/o • In this context, the testing problems in (1.3) will arise if we want to verify whether 
the unknown concentration is above a threshold, or if it is below. This is precisely the decision 
limit problem. We refer to Currie (1988) for a detailed discussion of this problem as it arises 
in analytical chemistry. The testing problems (1.3) in the context of bioassays are addressed, 
for example, in Dunne (1995). A good discussion of decision and detection limits, along with 
environmental applications, can be found in Gibbons (1994, Chapter 5). Other examples and 
applications are given in a number of books and articles; see the book by Brown (1993) and the 
article by Osborne (1991) for a review. 

For the models (1.1) and (1.2), appropriate tests for the hypotheses in (1.3) can be derived 
under two different situations. The first is when the calibration data are used only once for 
the purpose of testing hypothesis. In the context of the chemical example mentioned above, 
this means that the calibration data are collected and used to test hypothesis concerning an 
unknown concentration of a single sample. A more realistic scenario is one where the same 
set of calibration data are used repeatedly in order to test hypotheses concerning a sequence 
of 0— values, one at a time, after observing the corresponding sequence of responses. This is 
the situation of multiple use of the calibration data. In other words, we have a sequence of 
independent responses y 0i , corresponding to a sequence of 6 values 6i, following the model 

Voi ~ A r (a -I- ,69,, a 2 ), (1-4) 

similar to (1.2), i = 1, 2, 3, .... Correspondingly, we have the following sequence of hypotheses 
to be tested. 

(i) Ha. : 0j > Ci vs Hu : 9i < c,, (ii) Ho, : 9, < Ci vs Hu : 9, > c,:, (1-5) 

where the q’s are known scalars. For instance, in Example 1, we want to verify if for every 
individual tested, the blood alcohol concentration is no more than 0.10, or no more than 0.08, 
where the value 0.10 or 0.08 is to be used depending on the legal maximum limit in a state. In 
other words, we have hypotheses of the type (ii) in (1.5), where the q’s can take two different 
values, 0.10 and 0.08. In the decision limit problem, we may be interested in testing if the con- 
centration of a chemical or a pollutant in different samples exceeds a particular safety threshold. 
Such tests may have to be done a large number of times, as and when a sample is obtained. In 
other words, we need to test a sequence of hypotheses of the type (i) or (ii) in (1.5). It is also 
clear that in many applications, the c t ’s in (1.5) will have a common value. 

Tests that we shall derive for testing the sequence of hypotheses in (1.5) will have two types 
of uncertainty statements associated with them. One with respect to the sequence of responses 
?/o, in (1.4), and a second uncertainty statement with respect to the calibration data, i.e., the 
2 /j’s in (1.1). This is formally explained in Section 2. The distinction between single use and 
multiple use of the calibration data are also made for the construction of confidence regions. In 
fact confidence regions that involve multiple use of the calibration data are derived subject to 
two types of uncertainty statements, similar to those mentioned above. We refer to the recent 
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article by Mee and Eberhardt (1996) for an excellent discussion on this. The results in Dunne 
(1995) deal with the set up of single use of the calibration data. In fact Dunne (1995) provides 
solutions to the testing problems in (1.3) in the single use situation. Consequently, in this article, 
we shall study the problem for the case of multiple use of the calibration data. 

In the next section, we shall derive the tests. The two uncertainty statements, mentioned 
earlier, are explicitly stated and tests are derived subject to these. In our derivations, we have 
assumed that p > 0 in (1.1). Essentially, this amounts to assuming that the sign of P is known. 
This is clearly a reasonable assumption since the nature of the dependence of the response vari- 
able on the explanatory variable will be known in actual applications. In case P < 0, we can 
assume that the slope in (1.1) is -/3, after multiplying ij 3 by -1. In fact, the tests (i.e., decision 
limits) derived in Dunne (1995) use the fact that the sign of 3 is known. In Section 3, we apply 
our results to the alcohol concentration data, mentioned in example 1. Some concluding remarks 

appear in Section 4. 


2. The Test 


Among the two hypotheses in (1.5), here we shall consider only the sequence (i). Once we derive 
tests for testing (i), similar results can be obtained for testing the sequence (ii) in (1.5). 

Let a and P denote the least squares estimators of a and ft based on the y/s in (1.1), and 
let a 2 denote the unbiased estimator of a 2 based on the residual sum of squares. Then 




a 


y — fix, and 


d 2 = 


1 

N - 2 


N 

j=l 


a — fixjY 


( 2 . 1 ) 


where y and x denote the averages of the y/s and the z/s, respectively. As pointed out in the 
introduction, we assume that P > 0. 


2.1. Motivation of the test statistic 

Suppose q, P and a 2 are known. Then, from (1.2), 


e = 


yo - q 

P 




Hence, we shall reject H 0 : 6 > c in favor of H x \ 6 < c if is small, or equivalently, if 


—3c 


is small. In order to arrive at the latter conclusion, we have used the expression for 0 (given 
above), along with our assumption P > 0. In practice, usually a, P and a 2 are unknown, and 
we shall replace them by their estimators. Thus let 


T _ yo- a- Pc 

a 

Our test for Hq. 0 > c will reject Hq in favor of H\. 0 < c if T is small. Hence, for testing the 
sequence of hypotheses (i) in (1-5), consider the test statistic 


Ti = 


Voi - ® 


Pc,. 


( 2 . 2 ) 
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We shall reject H 0i : 9, > c, in favor of H u : 9i < c, if 

Ti < k h ( 2 - 3 ) 

i = 12 , 3, where the kf s are to be determined. Note that the test statistic 7j depends 
on the response y 0l and also on the calibration data y jt j = 1, 2, .... N, through a, (3 and a. 
Furthermore, (2.3) represents a sequence of rejection regions, for i F 2, 3, 


2.2. Criterion for the derivation of the test 

The derivation of our test amounts to the computation of the kf s in (2.3). We shall now explain 
the condition to be satisfied by the kf s. Since the calibration data will be used repeatedly for 
testing a sequence of hypotheses, consider the proportion of Hoi s that are rejected, when they 
are true, conditionally given the calibration data, i.e., conditionally given a, (3 and a. In other 
words, conditionally given d, /5 and a, consider the proportion of times T { < k u under H 0i : 
g > i = i 2 . 3, .... The kf s are to be determined so that with a high probability, the 
calibration data will guarantee that this proportion is small (i.e., less than a specified bound), 
for all 0fs under the corresponding null hypothesis H 0l . Note that given a, [3 and a, 

r cl — ct T /3 6 i f3ci (j 

Ti~ N T , T3 ■ 


Hence, conditionally given a , and a , Ti < ki is equivalent to 


ZOi < 



a — ot T (39j (3ci 
a 


a 

J 

a 


(2.4) 


where z 0i ~ N( 0, 1). We shall now give an expression for the proportion of H 0l that are rejected 
(i.e., proportion of times Ti < ki), conditionally given d, /? and <5\ Let 


A, = 


1, if Ti < h 
0, otherwise. 


(2.5) 


Conditionally given a, (3 and a, the Ads are independent Bernoulli random variables with 
success probability, say p,;, given by 


p i — P{T < ki 


aJ,a) = HkiZ~- 


a + (39 t — (3 c, 




(2.6) 


where $ denotes the standard normal cdf. In order to arrive at (2.6), we have used the fact that 
conditionally given a, (3 and d, T t < h is equivalent to (2.4). If a sequence of n tests are carried 
out, then the proportion of times Ti < k t is £ X)”=i A,;. By the strong law of large numbers, we 


get 


1 n 


n 


1 n 

Ep*. 


t= l 


n 


i=i 


as n becomes large. In other words, conditionally given a, (3 and a, the proportion of H 0i that 
are rejected is ^H"=iPu when n is large. We w r ant this proportion to be small, w r ith a high 
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probability, for all 9 l under the corresponding null hypotheses H 0i : 9, > c, : , i - 1, 2, 3, .... That 
is, the kj’s must satisfy 

=7, (2-7) 


P ' 

r a,{3,cr 


1 n 

-E^< 


n f=i 


for all Oi under the corresponding null hypotheses HqP 9 t > c,;, i — 1, 2, 3, ...., n, where e is 
a specified bound (say, e = 0.05) and 7 is the chosen probability (say, 7 = 0.95). Due to our 
assumption (3 > 0, it follows from the expression for pi in (2.6) that p l is maximum under Hoi- 
9 t > Ci when 0i = c t . Hence (2.7) holds for all 9 t satisfying 9 t > d if and only if it holds for 
9 Z — Ci . Using expression for p,; in (2.6), we thus get the following condition to be satisfied b> 

the hi s: 


P. 




n , o 


a a — Q + (P — /3)c; 


1=1 


a 




= 7- 


( 2 . 8 ) 


The condition (2.8) can be interpreted as the “size of the tests” being small, i.e., less than or 
equal to e, with probability 7. We note that (2.8) must actually hold for every n. 

Thus the criterion to be used for the derivation of the test, i.e., the computation of the ki s, 
is the condition (2.8), which must hold for all n ^ 1. As is clear from the derivation of (2.8), 
this condition reflects the fact that the same calibration data are used a large number of times 
for testing hypotheses. The fact that the same calibration data are used repeatedly is also the 
reason why we don’t want to consider the unconditional distribution of the T) s in (2.2). 


2.3. Computation of the ki ’s 
Note that 

a — a + (P — fyd = (a + (3x — y) + {(3 — — x). 

Define 


Z\ = [ Yfx, — x) 2 ] 1//2 (/? — /3)/cr ~ N (0, 1), Z 2 — \fFj{y — a — Px)/a ~ A^(0, 1), 
j = 1 

u2 = ( N - 2)^2 ~ Xn- 2> and _ l)2]i/2- (2 - 9) 

The random variables z\ y Z 2 and u in (2.9) are independently distributed, and \ r denotes the 
central chisquare distribution with r df. The ki that we compute will be a function of c u given 
in (2.9). Hence from now on we shall use the notation k(cn) instead of ki. Using the quantities 
in (2.9), (2.8) simplifies to 


P, 


Z 1 ,22 


-£*{*(<*) 


U 


n 


£=i 




+ Z\C\i + 


z 2 

Vn 


}<^ 


= 7- 


(2.10) 


It appears difficult to obtain k(cu ) explicitly, except in the special case when the Cu s are 
all equal. We shall first consider this special case. From the expression for Cu, given in (2.9), it 
follows that the c u ’s are all equal if and only if the Cj’s are all equal. As already pointed out, 
the cf s will be equal in many practical applications; see the discussion following (1.5). When 
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the a ' s are all equal, let c denote the common value of the a’s and let c 0 denote the common 
value of the c H ’s. Hence, from (2.9), 


(c-s) 

Co “ 


( 2 . 11 ) 


In this special case, (2.10) simplifies to 


Z i ,Z2,1t 


${k(c 0 ) 


u 


22 


y/TT- 2 


+ 2lC0+ jN ] - e 


= 7> 


( 2 . 12 ) 


or, equivalently, „ 

PzuziAKco) Jjy 1~2 + ZlC ° + yf/ - = 7 ’ ^ 2 ' 13 ^ 

where z(e ) denotes the 100e th percentile of the standard normal distribution. We note that the 
condition (2.12), or equivalently, (2.13), is essentially the condition to be satisfied by one sided 
tolerance intervals. Indeed, we can arrive at (2.12) by starting with the expression for a one 
sided tolerance interval and using the fact that /? > 0. The solution to the one sided tolerance 
interval problem is already known; see Guttman (1970, pp. 87-89). The solution to k(co) is 
given by 

*(<*) = + - 2; ^ (2-14) 

where G- 7 [r; A] denotes the 100(1 - q) th percentile of the noncentral t-distribution with df = r 
and noncentrality parameter A. Thus in the special case where the c, ; ’s are all equal to a common 
value c, fc(c 0 ) in (2.14) satisfies (2.12), where c 0 is given in (2.11). 

When the ci/s are not all equal, it is not possible to obtain k(cn) explicitly. In what follows, 
we shall explore the following possibility. Let k(cu) satisfy 


P: 


Z\,Z2>U 


${k{c u 


u 


22 


2 


+ Zicu + ^ } < e 


= 7- 


(2.15) 


In other words, 


k(cu) 


,1_ 

W 


+ - 2 ; 


z(e) 


Hi + 4) 


1/2 J 


(2.16) 


The expression in (2.16) follows since (2.15) is similar to (2.12). We shall numerically investigate 
whether k{c u ) in (2.16) will satisfy (2.10) for various sequences {c it }. Note that (2.15) is a toler- 
ance interval condition and is quite different from (2.10). In a recent article on the construction 
of confidence regions in the calibration problem, Mee and Eberhardt (1996) have indicated that 
intervals obtained using a tolerance interval condition may satisfy the requirements of multiple 
use confidence regions. Motivated by this, we shall explore whether k(c \ ,;) in (2.16) will satisfy 
(2.10). 


2.4. Numerical results for k(cu ) in (2.16) 

We have carried out some numerical results to check if k(cu) in (2.16) will satisfy (2.10). For 
this purpose, we need to simulate the left hand side (lhs) of (2.10) using /c(cij) given in (2.16), 
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for various sequences {cjj}. Note that when the cn s are all equal, i.e., when the c x s are all 
equal, such a numerical study is not necessary, since, in this case, fc(co) satisfies (2.12), where 
c 0 denotes the common value of the c u ’s. Thus the numerical results that we report here are 
only for the case when the c H 's are unequal. We shall first discuss our choices for the sequence 
{c lt }. When the calibration data are collected based on a carefully designed experiment, then 
the x/s in (1.1) will cover the range of practical interest. Hence it is reasonable to assume that 
the Qi s in (1.4) belong to the range determined by the x/s, i.e., 

min(xj) < 6i < max(x/, (2.17) 

i = l, 2, 3, ....Consequently, when (2.17) holds, the a's in (1.5) should also belong to the range 
of the x/s. In other words, when we test the null hypothesis H 0 , : 6 r > c, : , we are testing if 9 t is 
greater than or equal to a specified value in the range of the x/s. Hence we shall assume 

min(Xj) < Ci < max(x/, 

i = 1,2, 3, .... From the definition of c Xi given in (2.9), (2.18) gives 

min(xj) -x ^ ^ max(x/ - x 

ISfTTWr* - c “ - 

i = 1. 2, 3, .... In particular, (2.19) implies that 

-1 <c lt < 1. 


(2.18) 

(2.19) 

(2.20) 


The actual bounds for c u , given in (2.19), could be much narrower than [-1, 1]. In any case, in 
order to investigate whether k(c u ) in (2.16) will satisfy (2.10), it appears reasonable to restrict 
Cu ' s in the interval [-1, 1]. In our numerical results, we have also considered the wider interval 
[- 2 , 2 ]. 

For various sequences {c H }, i = 1, 2, ...., 10,000, we have simulated the following quantity. 


Z 1 ,Z2jU 


1 


L io, ooo 


10,000 u 

Xj + ZlCl1 + 



( 2 . 21 ) 


based on 10,000 simulations. The IMSL Fortran function subroutine TNIN is used to compute 
k(cu). Note that (2.21) is an approximation to the lhs of (2.10). Since (2.10) holds exactly 
when the ci/s are all equal, (2.10) is expected to hold approximately when the ci/s are nearly 
equal. Thus it is necessary to investigate whether (2.10) will hold when the c u ’s are as unequal 
as possible. In any finite sequence {ci,}, with cu G [—a, a], the c Xl s are as unequal as possible, 
i.e., the variance among them is a maximum, when half of the Cj, s are equal to a and the 
remaining half are equal to a. In this case, (2.21) reduces to 


Z\,Z2,U 




— Z\d + 


^2 

y/N 


} + <h{fc(a) 


U 


#^2 


+ Z\d + 


y/N 


< e 


( 2 . 22 ) 


The following IMSL subroutines are used in the numerical studies: TNIN to compute k(c u )\ 
RNNOA to generate normal variates; and RNCHI to generate chi-square variates. For N = 20, 
e = 0.05 and 0.01, 7 = 0.99, 0.95, 0.90 and 0.85, and a = 2, 1, 0.5 and 0.1, Table 1 gives the 
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simulated values of (2.22), based on 10,000 simulations. When the sequence {c u }, i = 1, 2, ...., 
10.000 consists of 10,000 equispaced values in the interval [-2,2] and [-1, 1], the simulated val- 
ues of (2.21) appear in Table 2. If 10,000 c u ’s are randomly generated according to a standard 
normal distribution, the simulated values of (2.21) appear in Table 3. 


Table la 

The simulated values of (2.22) for N = 20 and e = 0.05, for different values of 7 and a 


a 

0.99 

7 

0.95 

0.90 

0.85 

2 

0.987 

0.936 

0.852 

0.777 

1 

0.991 

0.950 

0.889 

0.826 

0.5 

0.993 

0.959 

0.916 

0.857 

0.1 

0.990 

0.952 

0.902 

0.849 


Table 16 

The simulated values of (2.22) for N = 20 and e = 0.01, for different values of 7 and a 
a 7 


0.99 0.95 0.90 0.85 


2 

0.985 

0.923 

0.848 

0.753 

1 

0.991 

0.938 

0.865 

0.799 

0.5 

0.991 

0.952 

0.885 

0.824 

0.1 

0.992 

0.949 

0.897 

0.857 


Table 2 

The simulated probabilities (2.21) for N = 20 when the sequence {<7,} assumes 10,000 equis- 
paced values in the interval [—a, a] 


[-a, a] 

e 

0.99 

7 

0.95 

0.90 

0.85 

[-2,2] 

0.05 

0.989 

0.954 

0.902 

0.852 

[-1,1] 

0.05 

0.992 

0.951 

0.898 

0.851 

[-2,2] 

0.01 

0.991 

0.954 

0.907 

0.846 

[-1,1] 

0.01 

0.993 

0.947 

0.902 

0.851 


Table 3 

The simulated probabilities (2.21) for N = 20 when the sequence {c lt } is generated based on 

A r (0,l). i = 1, 2, ...., 10,000 


7 

e 0.99 0.95 0.90 

0.85 

0.05 0.989 0.950 0.897 

0.832 

0.01 0.992 0.951 0.899 

0.834 


The simulation results indicate that k(c u ) in (2.16) will satisfy (2.10) reasonably well at least 
in situations where the range for Ci, , given in (2.19), is a narrow interval. As expected, (2.10) is 
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more satisfactorily met for narrower intervals. As already pointed out, it appears quite reason- 
able to assume that -1 < c u < 1. In practice, the range for c u is likely to be much narrower, 
especially when N is somewhat large. This should be clear from (2.19). In terms of meeting 
the condition (2.10), the results are somewhat unsatisfactory in the set up of Table 1, especially 
f or 7 = 0.85. However, Table 1 represents the ‘worst case scenario’, since (2.22) corresponds to 
the situation where half of the c H ’s are equal to -a and the remaining half are equal to a (for 
a = 2, 1, 0.5, 0.1). In other w T ords, among the sequence of hypotheses being tested, half of the 
time we are testing if the parameter is greater than or equal to one extreme of the parameter 
space (i.e., -a) and the other half deals with testing if the parameter is greater than or equal 
to the other extreme (i.e., a). In actual applications, this appears unrealistic. Nevertheless, the 
purpose of reporting the numerical results in Table 1 is to see of (2.10) (i.e, (2.22)) is met in 
this extreme situation. For large values of 7 the results are quite satisfactory, especially when 
a is small. In any case, the numerical results in Table 1-Table 3 provide evidence regarding the 
extent to which (2.16) will satisfy (2.10). The overall conclusion is that in practical applications 
that call for multiple use of the calibration data for testing the hypotheses in (1.5), k(cu) given 
in (2.16) is quite adequate for meeting the requirement (2.10). 

2.5. Testing the hypothesis 1.5 (ii) 

Tests for the hypotheses (ii) in (1.5) can be similarly derived. The rejection regions now take 
the form Tj > ki(cu), where T t is given in (2.2) and Cu is given in (2.9). When the Cu s are all 
equal to c 0 , fci(c 0 ) is given by 

*,(<*) = (jf + 4)'%\N - 2; ( f ( |~T ], (2.23) 

similar to (2.14). Also, ki(c H ) has an expression similar to that of k(cu) in (2.16). 

3. An Example 

A study was conducted at Acadiana Criminalstics Laboratory, New Iberia, Louisiana, to compare 
the breath estimates of blood alcohol concentration with those determined by a laboratory test. 
A sample of 15 subjects was used. In Table 4, we present the breath estimates (y 3 ) obtained 
using Breathalyzer Model 5000 and the results of the laboratory test (xj). These numbers are 
percentages of alcohol concentration in blood. Here we assume that the x/s are accurately 
measured, i.e., the measurement errors are small enough to be ignored, so that the Xj s can be 
assumed to be fixed. A simple linear regression model was fit with y regressed on x based on 
all 15 pairs of observations. The fitted model is y = 0.00135 + 0.958x and the value of R is 
0.93. The normal probability plot of the residuals is reasonably linear and hence the distribution 
of error terms does not depart from a normal distribution. Since in many states in the USA 
the legal maximum limit of blood alcohol concentration while driving is 0.1%, we want to test 
whether 0 < 0.10 or 6 > 0.10. In other words, here we shall consider the situation where the Ci s 
in (1.5) are all equal and equal to 0.10. In this problem, one may want to control the probability 
of concluding 0 > 0.10 when in fact 0 < 0.10. Thus we shall consider the following null and 
alternative hypotheses. 

H o :0< 0.10 vs H 1 :0> 0.10. (3.1) 



(3.2) 


We shall carry out the test using e = 0.05 and 7 = 0.95. From (2.23), 

1 o ,/o , z(0.95) . 

M c o) = {jj + c o) / to.9s[N ~ 2 ; (J_ + c 2)i/2 1' 

We reject H 0 for the i th individual if T» > h(co). 

In Table 4, we give the classical estimates obtained by solving from the fitted model for x, 
and the values of test statistics T t . The T t ’ s are computed using the leave-one-out method; for 
example to compute the point estimate and the value of the test statistic corresponding to iji — 
0.145 in Table 4, the observation (.145,-160) is not used to fit the model. Thus, N — 14, in (3.2). 
Fitting the model by deleting the first observation, we get d = .0007, (3 = .97, c 0 = -0.077, 
Ti = 3.40, and fci(co) = 2.67. Furthermore, fi = = ■ 145 ' g7 4 * * 0007 = .149. This process can be 

carried out by deleting each observation in Table 4. 


Table 4 

Blood Alcohol Concentrations Data, Estimates and Test Statistics 



lh 

Xi 

Xi 

Ti 

fci(co) 

1 

.145 

.160 

.149 

3.40 

2.67 

2 

.156 

.170 

.160 

4.18 

2.67 

3 

.181 

.180 

.189 

6.00 

2.67 

4 

.108 

TOO 

.112 

0.85 

2.68 

5 

.180 

.170 

.189 

6.27 

2.67 

6 

.112 

TOO 

.117 

1.19 

2.68 

7 

.081 

.060 

.086 

-1.11 

2.69 

8 

.104 

TOO 

.108 

0.53 

2.68 

9 

.176 

.170 

.185 

5.78 

2.67 

10 

.048 

.056 

.047 

-3.56 

2.69 

11 

.092 

.111 

.093 

-0.47 

2.68 

12 

.144 

.162 

.147 

3.38 

2.67 

13 

.121 

.143 

.124 

1.73 

2.67 

14 

.065 

.079 

.065 

-2.42 

2.68 

15 

.000 

.006 

-.006 

-7.05 

2.71 


We observe from Table 4 that the test results are in agreement with those based on labora- 
tory test except for the 11th and 13th subjects. We also note that for the 15th observation, 27 
= 0.006 and i* = —0.006. This discrepancy may be due to the fact that the corresponding y, 
= 0 and is very close to zero. 

4. Concluding Remarks 

In this article, we have investigated hypothesis testing in the calibration problem. In the uni- 

variate case, we have succeeded in deriving appropriate tests for testing one sided hypotheses 

in the situation of multiple use of the calibration data. The results are illustrated using an 
example. The two sided testing problem is not considered in this article, since tests in this 
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context can be obtained by inverting the multiple use confidence regions that are available in 
the literature. One-sided multiple use confidence regions can certainly be developed and can be 
used for testing the hypotheses in (1.5). The simultaneous tolerance intervals in Mee, Eberhardt 
and Reeve (1991) can be used to obtain conservative multiple use confidence regions, and hence 
conservative multiple use tests, for the two sided testing problem. The one-sided simultaneous 
tolerance intervals in Odeh and Mee (1990) can similarly be used for obtaining conservative 
multiple use tests for the one sided testing problems in (1.5). However, in the present article, 
our goal has been to investigate whether the tolerance interval condition can be used for carrying 
out multiple use hypotheses tests. The numerical results in Section 3 show that this is indeed 
the case. Note that our results are applicable only in the univariate set up and similar results 
are currently not available in the multivariate case. The problem of testing hypothesis in the 
multivariate calibration problem is currently under investigation. 
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